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<110> APPLICANT: Boyce Thompson Institute for Plant Research 

Pioneer Hi-Bred International, Inc. 

May, Gregory D 

Baszczynski, Christopher L 

Zhu, Tong 

Kipp, Peter B 

Mahajan, Pramod B 
<120> TITLE OF INVENTION: PLANT MSH2 SEQUENCES AND METHODS OF USE 
<130> FILE REFERENCE: 42960/257618 
<140> CURRENT APPLICATION NUMBER: PCT/US02/40887 
<141> CURRENT FILING DATE: 2002-12-20 
<150> PRIOR APPLICATION NUMBER: US 10/029,065 
<151> PRIOR FILING DATE: 2001-12-20 
<160> NUMBER OF SEQ ID NOS : 42 
<170> SOFTWARE: Patent In version 3.1 
<210> SEQ ID NO: 1 
<211> LENGTH: 3033 
<212> TYPE: DNA 

<213> ORGANISM: Nicotiana tabacum 
<220> FEATURE: 
<221> NAME/KEY: CDS 
<222> LOCATION: (22).. (2838) 
<223> OTHER INFORMATION: 
<400> 1 

ataaaggtta aagaaaaaaa a 



ENTERED 



aag 
Lys 
10 
tea 



atg aat gaa aat ttg gag gaa cag age 
Met Asn Glu Asn Leu Glu Glu Gin Ser 
1 5 
ctt ccc gag ctt aaa ctg gat get aag caa get caa gga ttt etc 
Leu Pro Glu Leu Lys Leu Asp Ala Lys Gin Ala Gin Gly Phe Leu Ser 

15 20 25 

ttc ttc aaa acc ctg ccc aag gac cct agg gca gtt cgc etc ttt gat 
Phe Phe Lys Thr Leu Pro Lys Asp Pro Arg Ala Val Arg Leu Phe Asp 

30 35 40 

cgt egg gac tat tat aca tct cat gga gat gat gca act ttc att gca 
Arg Arg Asp Tyr Tyr Thr Ser His Gly Asp Asp Ala Thr Phe He Ala 

45 50 55 

gag aca tat tac cac aca aca act gcg tta cga cag ttg ggt aat aga 
Glu Thr Tyr Tyr His Thr Thr Thr Ala Leu Arg Gin Leu Gly Asn Arg 

60 65 70 

get gat gec ctt tec agt gtt agt gtg agt aga aac atg ttt gaa aca 
Ala Asp Ala Leu Ser Ser Val Ser Val Ser Arg Asn Met Phe Glu Thr 
75 80 85 90 

ata get cgt gac att etc ttg gag aga atg gac cgt act ctt gaa eta 



51 



99 



147 



195 



243 



291 



339 
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58 He Ala Arg Asp He Leu Leu Glu Arg Met Asp Arg Thr Leu Glu Leu 

59 95 100 105 

61 tat gag ggc agt ggt tea aac tgg aga ctg gta aaa agt gga acc cca 387 

62 Tyr Glu Gly Ser Gly Ser Asn Trp Arg Leu Val Lys Ser Gly Thr Pro 

63 110 115 " 120 

65* ggg aat ctt gga agt ttt gag gat att ctg ttt get aat aat gaa atg 435 

66 Gly Asn Leu Gly Ser Phe Glu Asp He Leu Phe Ala Asn Asn Glu Met 

67 125 130 135 

69 caa aat tct ccg gtg att get get ctt get cca aac ttc ggt cag aat 483 

70 Gin Asn Ser Pro Val He Ala Ala Leu Ala Pro Asn Phe Gly Gin Asn 

71 140 145 150 

73 gga tgt gaa gtt ggc tta ggc tat gtt gat att act aag aga gtc ctt 531 

74 Gly Cys Glu Val Gly Leu Gly Tyr Val Asp He Thr Lys Arg Val Leu 

75 155 160 165 ~ 170 

77 ggt tta aca gaa ttt eta gat gat age cac ttc aca aat ttg gag tct 579 

78 Gly Leu Thr Glu Phe Leu Asp Asp. Ser His Phe Thr Asn Leu Glu Ser 

79 175 180 185 

81 get ttg gtt get ctt ggt tgc aga gaa tgt ctt gta cca gcg gag act 627 

82 Ala Leu Val Ala Leu Gly Cys Arg Glu Cys Leu Val Pro Ala Glu Thr 

83 190 195 200 

85 ggc aaa tec agt gaa tac agg cct atg ttt gat gca ata tct aga tgc 675 
8 6 Gly Lys Ser Ser Glu Tyr Arg Pro Met Phe Asp Ala He Ser Arg Cys 
87 205 210 215 

89 ggc gtg atg gta act gaa aga aag aaa act gaa ttt aaa ggg aga gat 723 

90 Gly Val Met Val Thr Glu Arg Lys Lys Thr Glu Phe Lys Gly Arg Asp 

91 220 225 230 

93 ttg gta cag gat ctt ggt agg etc gtc aag ggt tea gta gaa cct gtt 771 

94 Leu Val Gin Asp Leu Gly Arg Leu Val Lys Gly Ser Val Glu Pro Val 

95 235 240 245 250 

97 cga gat ttg gtc tct ggg ttc gaa tgt gca tea ggc get ttg ggg tgc 819 

98 Arg Asp Leu Val Ser Gly Phe Glu Cys Ala Ser Gly Ala Leu Gly Cys 

99 255 260 265 

101 ata ctt tct tat gca gaa eta ctt gcg gat gag age aac tat gga aac 8 67 

102 He Leu Ser Tyr Ala Glu Leu Leu Ala Asp Glu Ser Asn Tyr Gly Asn 

103 270 275 280 

105 tat aca gtc aaa caa tac aac etc aat agt tac atg aga tta gat tct 915 

106 Tyr Thr Val Lys Gin Tyr Asn Leu Asn Ser Tyr Met Arg Leu Asp Ser 

107 285 290 295 

109 get get atg aga gca ctg aat gtt atg gag age aaa tea gat get aat 963 

110 Ala Ala Met Arg Ala Leu Asn Val Met Glu Ser Lys Ser Asp Ala Asn 
HI 300 305 310 

113 aaa aat ttt age ttg ttc ggt ctg atg aat aga acg tgt act get gga 1011 

114 Lys Asn Phe Ser Leu Phe Gly Leu Met Asn Arg Thr Cys Thr Ala Gly 

115 315 320 325 ' 330 

117 atg ggt aaa agg tta ttg cac atg tgg ctg aag caa cct tta eta gat 1059 

118 Met Gly Lys Arg Leu Leu His Met Trp Leu Lys Gin Pro Leu Leu Asp 
H9 335 340 345 

121 gta gaa gag att aac tgt agg ctg gat tta gtt caa tea ttc gtg gag 1107 

122 Val Glu Glu He Asn Cys Arg Leu Asp Leu Val Gin Ser Phe Val Glu 
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149 

150 

151 
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165 

166 

167 

169 

170 

171 

173 

174 

175 

177 

178 

179 

181 

182 

183 

185 

186 

187 



gat get 
Asp Ala 

gat att 
Asp lie 
380 
gtg cac 
"Val His 
395 

aaa agt 
Lys Ser 

gaa agg 
Glu Arg 

aat aag 
Asn Lys 



gag 
Glu 

get 
Ala 
475 
cac 
His 



aat 
Asn 
460 
ctg 
Leu 

aaa 
Lys 



aaa eta 
Lys Leu 

aaa gaa 
Lys Glu 



gcg 
Ala 
365 
gag 
Glu 

gtt 
Val 

gtt 
Val 

tat 
Tyr 

ttc 
Phe 
445 
gga 
Gly 

aag 
Lys 

caa 
Gin 

gat 
Asp 



350 
ctt 
Leu 

egg 
Arg 

gta 
Val 

ttg 
Leu 

att 
He 
430 
ata 
He 

gaa 
Glu 

gat 
Asp 

act 
Thr 

aaa 
Lys 
510 



cgc caa 
Arg Gin 

ctg aca 
Leu Thr 

aaa etc 
Lys Leu 
400 
gaa cgt 
Glu Arg 
415 

gat tct 
Asp Ser 

ggt ctt 
Gly Leu 

tac atg 
Tyr Met 

gag caa 
Glu Gin 
480 
gec aat 
Ala Asn 
495 

gaa aca 
Glu Thr 



gat ttg 
Asp Leu 
370 
cac aat 
His Asn 
385 

tat cag 
Tyr Gin 

cat gat 
His Asp 

eta gag 
Leu Glu 

gtg gaa 
Val Glu 
450 
att tct 
He Ser 
465 

gag aca 
Glu Thr 

gat ctt 
Asp Leu 

caa ttt 
Gin Phe 



355 

agg cag cat 
Arg Gin His 

ctt gag agg 
Leu Glu Arg 

tea agt acc 
Ser Ser Thr 
405 

ggg caa ttt 
Gly Gin Phe 

420 
aaa tgg agt 
Lys Trp Ser 
435 

act tct gtt 
Thr Ser Val 

tct gca tat 
Ser Ala Tyr 



etc 
Leu 

aaa 
Lys 
555 
cag 
Gin 



gaa 
Glu 
540 
eta 
Leu 

aaa 
Lys 



gag gtg 
Glu Val 

ctg agt 
Leu Ser 



gaa cca aaa 
Glu Pro Lys 
525 

aca cgt aag 
Thr Arg Lys 

gga gat cag 
Gly Asp Gin 

gaa ttg gta 
Glu Leu Val 
575 

ttt gca ggt 
Phe Ala Gly 

590 
ttt gcg gat 
Phe Ala Asp 
605 



gtc agg aag 
Val Arg Lys 
530 

gat ggg gta 
Asp Gly Val 

545 
ttc cag aag 
Phe Gin Lys 
560 

get cgt gta 
Ala Arg Val 

ata get ggt 
He Ala Gly 

ttg get gee 
Leu Ala Ala 
610 



ttg 
Leu 

gat 
Asp 

gga 
Gly 
515 
cag 
Gin 



gag cga 
Glu Arg 
485 
eta cct 
Leu Pro 
500 

cac gtc 
His Val 

eta aat 
Leu Asn 



aag ttc acc 
Lys Phe Thr 



att 
lie 

gtt 
Val 

gta 
Val 
595 
agt 
Ser 



gta 
Val 

caa 
Gin 
580 
ctt 
Leu 



gag 
Glu 
565 
aca 
Thr 

get 
Ala 



tgc cca 
Cys Pro 



ctg aaa 
Leu Lys 
375 
aaa aga 
Lys Arg 
390 

aga gta 
Arg Val 

gca aca 
Ala Thr 

gat gat 
Asp Asp 

gac ctt 
Asp Leu 
455 
gac cca 
Asp Pro 
470 

caa att 
Gin He 

att gat 
He Asp 

ttc aga 
Phe Arg 

tct cac 
Ser His 
535 
tat aca 
Tyr Thr 
550 

gag tac 
Glu Tyr 

get gcg 
Ala Ala 

gag ttg 
Glu Leu 

act ccc 
Thr Pro 
615 



360 

aga att tea 
Arg He Ser 

gee agt tta 
Ala Ser Leu 



cca 
Pro 

etc 
Leu 

aat 
Asn 
440 
gat 
Asp 



tat ate 
Tyr He 
410 
ate agg 
He Arg 
4 25 

cac ctg 
His Leu 

caa ctt 
Gin Leu 



aat tta tct 
Asn Leu Ser 



cat 
His 

aag 
Lys 

att 
He 
520 
tac 
Tyr 



aat ttg 
Asn Leu 
490 
tea ctt 
Ser Leu 
505 

acc aag 
Thr Lys 

att gtt 
He Val 



aaa etc aaa 
Lys Leu Lys 

aaa age tgt 
Lys Ser Cys 
570 

agt ttc tec 
Ser Phe Ser 
585 

gat gtg tta 
Asp Val Leu 
600 

tac aca aga 
Tyr Thr Arg 



1155 



1203 



1251 



1299 



1347 



1395 



1443 



1491 



1539 



1587 



1635 



1683 



1731 



1779 



1827 



1875 
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189 cca aat ate agt cca cca gat aca gga gat att ata ctt gaa ggg tgt 1923 

190 Pro Asn He Ser Pro Pro Asp Thr Gly Asp He He Leu Glu Gly Cys 

191 620 625 630 

193 agg cat cct tgt gtg gaa get caa gat tgg gtt aac tec att cct aat 1971 

194 Arg His Pro Cys Val Glu Ala Gin Asp Trp Val Asn Ser He Pro Asn 

195 635 640 645 650 

197 gac tgt aga eta gtt agg gga gag agt tgg ttt cag att ate aca ggc 2019 

198 Asp Cys Arg Leu Val Arg Gly Glu Ser Trp Phe Gin He He Thr Gly 
199- 655 660 665 

201 cct aac atg ggt gga aag teg acc tac att egg cag gtt ggt gtg aat 2067 

202 Pro Asn Met Gly Gly Lys Ser Thr Tyr He Arg Gin Val Gly Val Asn 

203 670 675 = 680 

205 gtc ctg atg gec caa gtt ggc teg ttt gtt cca tgt gac aat get acc 2115 

206 Val Leu Met Ala Gin Val Gly Ser Phe Val Pro Cys Asp Asn Ala Thr 

207 685 690 695 

209 att tct att cgt gat tgt att ttt §ct cgt gtt ggc get gga gat tgc 2163 

210 lie Ser He Arg Asp Cys lie Phe Ala Arg Val Gly Ala Gly Asp Cys 

211 700 705 710 

213 cag ctg aga gga gtt tct act ttt atg caa gag atg ctt gag act gca 2211 

214 Gin Leu Arg Gly Val Ser Thr Phe Met Gin Glu Met Leu Glu Thr Ala 

215 715 720 725 730 

217 teg ate ttg aaa gga get act gat aga tea ttg att ata att gat gag 2259 

218 Ser lie Leu Lys Gly Ala Thr Asp Arg Ser Leu He He He Asp Glu 

219 735 740 745 

221 ttg ggc cgt ggg aca tea acc tac gat ggc ttt ggt tta get tgg get 2307 

222 Leu Gly Arg Gly Thr Ser Thr Tyr Asp Gly Phe Gly Leu Ala Trp Ala 

223 750 755 ~ 760 

225 att tgt gag cac att gtt gaa gaa att aaa gca cca aca ttg ttt gee 2355 

226 He Cys Glu His He Val Glu Glu He Lys Ala Pro Thr Leu Phe Ala 

227 765 770 775 

229 act cac ttt cat gag ctg act gca tta gee aac aag aat gga gac aat 2403 

230 Thr His Phe His Glu Leu Thr Ala Leu Ala Asn Lys Asn Gly Asp Asn 

231 780 785 790 

233 gga cat aag aaa aat get ggg ata gca aat ttt cat gtt ttt gca cac 2451 

234 Gly His Lys Lys Asn Ala Gly He Ala Asn Phe His Val Phe Ala His 

235 795 800 805 810 

237 att gac cct tct aat cgc aag eta act atg ctt tac aag gtt cac cca 2499 

238 He Asp Pro Ser Asn Arg Lys Leu Thr Met Leu Tyr Lys Val His Pro 

239 815 820 825 

241 ggt get tgt gat cag agt ttt ggt att cat gtt get gaa ttt gca aat 2547 

242 Gly Ala Cys Asp Gin Ser Phe Gly lie His Val Ala Glu Phe Ala Asn 

243 830 835 840 

245 ttt cca ccg agt gtt gtg get ctg get aga gaa aag gca tct gag ttg 2595 
24 6 Phe Pro Pro Ser Val Val Ala Leu Ala Arg Glu Lys Ala Ser Glu Leu 
247 845 850 855 

249 gag gat ttc tct cct att gee ata att cca aat gac att aaa gag gca 2643 

250 Glu Asp Phe Ser Pro lie Ala He He Pro Asn Asp He Lys Glu Ala 

251 860 865 870 

253 get tea aaa egg aag aga gaa ttt gac cgc cat gac gtg tct aga ggt 2691 
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254 Ala Ser Lys Arg Lys Arg Glu Phe Asp Arg His Asp Val Ser Arg Gly 

255 875 880 885 " 890 

257 act gcc aga get egg caa ttc tta cag gat ttc get cag ttg cca ctg 2739 

258 Thr Ala Arg Ala Arg Gin Phe Leu Gin Asp Phe Ala Gin Leu Pro Leu 

259 895 900 905 

261 gat aag atg gat cca aac gtg gtc agg caa aag ttg age aaa atg aaa 2787 
2 62 Asp Lys Met Asp Pro Asn Val Val Arg Gin Lys Leu Ser Lys Met Lys 
263 ^ 910 915 920 

265 *acc gac ctg gag agg gat gca gtt gac tct cac tgg ctt cag caa ttc 2835 
2 66 Thr Asp Leu Glu Arg Asp Ala Val Asp Ser His Trp Leu Gin Gin Phe 
267 925 930 935 

269 ttt taattcttca gattagaact atcttctatt ctgtgaagct tgggggggaa 2888 

270 Phe 

273 tgatacttat gggttttgtg gatataactt agectatctg taaactttca tttaaatcct 2948 
275 taccccaaac atgattctct gtaatcaggg gacttttgta tgcattctgt gttaatagta 3008 
277 agegttatet tatatggtca aaaaa ' 3033 

280 <210> SEQ ID NO: 2 

281 <211> LENGTH: 939 

282 <212> TYPE: PRT 

283 <213> ORGANISM: Nicotiana tabacum 
285 <400> SEQUENCE: 2 

287 Met Asn Glu Asn Leu Glu Glu Gin Ser Lys Leu Pro Glu Leu Lys Leu 

288 15 10 15 

291 Asp Ala Lys Gin Ala Gin Gly Phe Leu Ser Phe Phe Lys Thr Leu Pro 

292 20 25 30 

295 Lys Asp Pro Arg Ala Val Arg Leu Phe Asp Arg Arg Asp Tyr Tyr Thr 

296 35 40 ~ 45 

299 Ser His Gly Asp Asp Ala Thr Phe He Ala Glu Thr Tyr Tyr His Thr 

300 50 55 60 

303 Thr Thr Ala Leu Arg Gin Leu Gly Asn Arg Ala Asp Ala Leu Ser Ser 

304 65 70 75 80 

307 Val Ser Val Ser Arg Asn Met Phe Glu Thr He Ala Arg Asp He Leu 

308 85 90 95 

311 Leu Glu Arg Met Asp Arg Thr Leu Glu Leu Tyr Glu Gly Ser Gly Ser 

312 100 105 HO 

315 Asn Trp Arg Leu Val Lys Ser Gly Thr Pro Gly Asn Leu Gly Ser Phe 

316 115 120 125 

319 Glu Asp He Leu Phe Ala Asn Asn Glu Met Gin Asn Ser Pro Val He 

320 130 135 140 

323 Ala Ala Leu Ala Pro Asn Phe Gly Gin Asn Gly Cys Glu Val Gly Leu 

324 145 150 155 160 

327 Gly Tyr Val Asp He Thr Lys Arg Val Leu Gly Leu Thr Glu Phe Leu 

328 165 170 175 

331 Asp Asp Ser His Phe Thr Asn Leu Glu Ser Ala Leu Val Ala Leu Gly 

332 180 185 190 

335 Cys Arg Glu Cys Leu Val Pro Ala Glu Thr Gly Lys Ser Ser Glu Tyr 

336 195 200 205 

339 Arg Pro Met Phe Asp Ala He Ser Arg Cys Gly Val Met Val Thr Glu 

340 210 215 * 220 



file://C:\CRF4\Outhold\VsrPU40887.htm 



1/17/03 



Page 6 of 8 



RAW SEQUENCE LISTING ERROR SUMMARY DATE: 01/17/2003 

PATENT APPLICATION: PCT/US02/40887 TIME: 15:17:44 

Input Set : A:\257618.txt 

Output Set: N:\CRF4\01172003\PU40887.raw 

Please Note: 

Use of n and/or Xaa have been detected in the Sequence Listing. Please review the 
Sequence Listing to ensure that a corresponding explanation is presented in the <220> 
to <223> fields of each sequence which presents at least one n or Xaa. 

Seq#:5r N Pos . 1,2 

Seq#:6; N Pos. 1,2,141 

Seq#:7; N Pos. 1,2 

Seq#:8; Pos. 1,2, 161 

Seq#:9; N Pos. 165,166 

Seq#:10; N Pos. 1,2 

Seq#:ll; N Pos. 1,2, 157, 158 

Seq#:12; N Pos. 1,2 

Seq#:13; N Pos. 222 

Seq#:21; N Pos. 11,12,13,14,15,16 

Seq#:29; N Pos. 3, 6, 9 

Seq#:30; N Pos. 9, 12 
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